Reducing Overheating-Induced Failures Via Performance-Aware CPU Power Management
نویسندگان
چکیده
Cluster end-users and administrators have become more cognizant of the fact that large-scale commodity clusters fail quite frequently, and the main source of these failures is hardware (e.g., processors) with the primary cause being heat. This situation is expected to worsen with even larger-scale clusters powered by faster (and/or multicore) processors. In this paper, we propose a power-management algorithm that addresses heat-related reliability for processors by controlling their clock speeds in a performance-aware manner. This approach is complementary to existing approaches such as exotic cooling and faulttolerant technologies in that it proactively deals with power and cooling issues before they become a problem. Our preliminary experimental work demonstrates that our approach can easily be applied commodity processors and can reduce heat generation by 30% on average with minimal effect on performance when running the SPEC benchmarks.
منابع مشابه
Event-Driven Thermal Management in SMP Systems
Actions usually taken to prevent processors from overheating, such as decreasing the frequency or stopping the execution flow, also degrade performance. Multiprocessor systems, however, offer the possibility of moving the task which caused a CPU to overheat away to some other, cooler CPU, so throttling becomes only a last resort taken if all of a system’s processors are hot. Additionally, the d...
متن کاملDisk-aware Request Distribution-based Web Server Power Management
This work is concerned with reducing power consumption by cluster-based web servers. We focused on server hard disks, a major source of server power consumption. We started with the modification of Logsim, a simulator for cluster-based web servers. The new simulator, NLogsim behaves exactly like a cluster-based web server handling requests when they arrive. Based on NLogsim, we exposed the rela...
متن کاملDisk-aware Request Distribution-based Web Server Disk Power Management
This report presents studies, implementation, and simulation we conducted for the course project of COS518, Fall 2003. The course project was concerned with reducing power consumption by cluster-based web servers. We focused on server hard disks, a major source of server power consumption. We started with the modification of Logsim, a simulator for cluster-based web servers. The new simulator, ...
متن کاملCoordinating Processor and Main Memory for Server Power Capping
With the number of high-density servers in data centers rapidly increasing, power capping with performance optimization has become a key challenge to gain a high return on investment by accommodating the maximized number of servers allowed by the limited power supply and cooling facilities. Various power capping solutions have been recently proposed for high-density servers and different compon...
متن کاملPower-aware and Temperature Restrain Modeling for Maximizing Performance and Reliability
Ability to constrain power consumption in the recent hardware architectures is a powerful capability that can be leveraged for efficient utilization of available power. We propose to develop power-aware performance models that can predict job performance given a resource configuration, that is, the CPU/memory power cap, the number of nodes, etc. In addition to performance optimization under a f...
متن کامل